Textually Summarising Incomplete Data
نویسندگان
چکیده
Many data-to-text NLG systems work with data sets which are incomplete, ie some of the data is missing. We have worked with data journalists to understand how they describe incomplete data, and are building NLG algorithms based on these insights. A pilot evaluation showed mixed results, and highlighted several areas where we need to improve our system.
منابع مشابه
On the Macroeconomics of Uncertainty and Incomplete Markets
Presidential address for the Twelfth World Congress of the International Economic Association, summarising semi-formally the author's recent work and concerns. Uncertainty and incomplete markets breed demand volatility as well as price and wage rigidities. The conjunction of these leads to multiple, volatile supply-constrained equilibria, typically reflecting coordination failures and apt to di...
متن کاملResilient Blocks for Summarising Distributed Data
Summarising distributed data is a central routine for parallel programming, lying at the core of widely used frameworks such as the map/reduce paradigm. In the IoT context it is even more crucial, being a privileged mean to allow long-range interactions: in fact, summarising is needed to avoid data explosion in each computational unit. We introduce a new algorithm for dynamic summarising of dis...
متن کاملSummarising Unreliable Data
Unreliable data is present in datasets, and is either ignored, acknowledged ad hoc, or undetected. This paper discusses data quality issues with a potential framework in mind to deal with them. Such a framework should be applied within data-to-text systems at the generation of text rather than being an afterthought. This paper also shows ways to express uncertainty through language and World He...
متن کاملAutomatic summarising: The state of the art
This paper reviews research on automatic summarising in the last decade. This work has grown, stimulated by technology and by evaluation programmes. The paper uses several frameworks to organise the review, for summarising itself, for the factors affecting summarising, for systems,
متن کاملA New Architecture for Summarising Time Series Data
This paper presents a new architecture for summarising complex time series data, in which four main components together with a knowledge base and a database are integrated. Based on the architecture, a knowledge-based text generation system has been implemented and its main functions are briefly explained in the context of a sample of data. Evaluation of the system has been done and some conclu...
متن کامل